Hazard models are powerful distress models.
Popular since Shumway (2001).
\(y_{it}\)s are binary distress indicator, \(\vec x_{it}\)s are firm covariates, and \(\vec{z}_t\)s are macro variables.
E.g., see Chava and Jarrow (2004), Beaver, McNichols, and Rhie (2005), and Campbell and Szilagyi (2008).
Cannot be captured by firm specific covariates, \(\vec x_{it}\), and macro covariates, \(\vec z_t\). Major issue for portfolio risk.
Extend to include latent effects
\[ \begin{align*} g(E(y_{it}|\vec x_{it}, \vec z_t, \vec u_t, \vec o_{it})) &= \vec\beta^\top\vec x_{it}+ \vec \gamma^\top \vec z_t + \vec u_t^\top \vec o_{it} \\ \vec u_t &\sim p_{\vec \theta}(\vec u_{t-1}) \end{align*} \]
Pioneered by Duffie et al. (2009).
E.g., see Koopman, Lucas, and Schwaab (2011), Hwang (2012), Qi, Zhang, and Zhao (2014), Nickerson and Griffin (2017), Azizpour, Giesecke, and Schwenkler (2018), and Kwon and Lee (2018).
Although an OU process is a reasonable starting model for the frailty process, one could allow much richer frailty models. …, however, we have found that even our relatively large data set is too limited to identify much of the time-series properties of frailty. … For the same reason, we have not attempted to identify sector-specific frailty effects.
Duffie et al. (2009)
Data set.
Models without frailty.
Models with frailty.
Merge CRSP and Compustat.
Merge with Moody’s Analytics Default & Recovery Database.
At risk: have not had a distress, is after resolution date, or 12 months after distress if resolution date is missing.
Events are alternative dispute resolution, conservatorship, cross default, loan forgiven, chapter 7, placed under administration, seized by regulators, deposit freeze, suspension of payments, grace period default, payment moratorium, prepackaged chapter 11, indenture modified, bankruptcy, receivership, missed principal payment, missed principal and interest payments, distressed exchange, chapter 11, and missed interest payment.
Missed interest payment is by far the most common.
Matching to Compustat is done with CUSIPs, ticker symbols, and company names.
Use quatlery data and impute yearly data when missing.
Data is lagged with 3 months and carried forward for up to 1 year.
Used to compute past 1 year excess log return.
Used to compute log relative market size.
Data is lagged with 1 month and carried forward for up to 3 months.
Very good predictor (Bharath and Shumway 2008).
Require 1 year initial data and at least 3 months of data in each window.
siccd if available. Otherwise use CRSP sich.
Final samples has 624,692 firm-months, 4,433 firms and 721 events.
Included variables are
All winsorized at 1% and 99%.
| Df | AIC | |
|---|---|---|
| First model | 12 | 7612.93 |
| New denominator | 12 | 7559.93 |
| Add distance-to-default | 13 | 7469.90 |
| Add macro variables | 15 | 7447.24 |
| Simplify model | 11 | 7441.04 |
| Df | AIC | |
|---|---|---|
| First model | 12 | 7612.93 |
| New denominator | 12 | 7559.93 |
| Add distance-to-default | 13 | 7469.90 |
| Add macro variables | 15 | 7447.24 |
| Simplify model | 11 | 7441.04 |
Issue with small denominators as in Campbell and Szilagyi (2008). Change denominator to 50% total assets and 50% market value.
| Df | AIC | |
|---|---|---|
| First model | 12 | 7612.93 |
| New denominator | 12 | 7559.93 |
| Add distance-to-default | 13 | 7469.90 |
| Add macro variables | 15 | 7447.24 |
| Simplify model | 11 | 7441.04 |
| Df | AIC | |
|---|---|---|
| First model | 12 | 7612.93 |
| New denominator | 12 | 7559.93 |
| Add distance-to-default | 13 | 7469.90 |
| Add macro variables | 15 | 7447.24 |
| Simplify model | 11 | 7441.04 |
Past 1 year market log return and 1 year T-bill rate as in Duffie et al. (2009) and other papers .
| Df | AIC | |
|---|---|---|
| First model | 12 | 7612.93 |
| New denominator | 12 | 7559.93 |
| Add distance-to-default | 13 | 7469.90 |
| Add macro variables | 15 | 7447.24 |
| Simplify model | 11 | 7441.04 |
| Estimate | Z-stat | |
|---|---|---|
| Intercept | -8.448 | -17.985 |
| Distance-to-default | -1.929 | -9.034 |
| Log excess return | -0.713 | -18.711 |
| Total liabilities / size | 0.437 | 8.365 |
| T-bill rate | -0.281 | -5.113 |
| Net income / size | -0.110 | -5.650 |
| Relative log market size | -0.107 | -1.702 |
| Current ratio | -0.101 | -1.999 |
| Idiosyncratic volatility | 0.092 | 2.449 |
| Log market return | 0.078 | 2.287 |
| Retained Earnings / size | 0.042 | 1.543 |
Terms that are included in Duffie et al. (2009 table III) have similar sign.
| AIC | Df | LR-stat | P-value | |
|---|---|---|---|---|
| Full model | 7377.98 | |||
| % Relative log market size | 7378.14 | 1 | 2.16 | 0.141196 |
| % Retained Earnings / size | 7378.98 | 1 | 3.00 | 0.083270 |
| % Log market return | 7380.39 | 1 | 4.41 | 0.035750 |
| % Current ratio | 7382.43 | 1 | 6.45 | 0.011094 |
| % Net income / size (spline term) | 7393.20 | 3 | 21.23 | 0.000094 |
| % Idiosyncratic volatility | 7398.73 | 1 | 22.75 | 0.000002 |
| % T-bill rate | 7406.89 | 1 | 30.91 | < 0.000001 |
| % Idiosyncratic volatility (spline term) | 7413.60 | 3 | 41.62 | < 0.000001 |
| % Distance-to-default | 7418.65 | 1 | 42.67 | < 0.000001 |
| % Net income / size | 7423.10 | 1 | 47.12 | < 0.000001 |
| % Total liabilities / size | 7457.40 | 1 | 81.42 | < 0.000001 |
| % Log excess return | 7732.36 | 1 | 356.38 | < 0.000001 |
Rows are models without the given term. The spline terms are orthogonal to the linear term. P-values are based on likelihood ratio test.
Gray area is pointwise 95% confidence intervals. The line is the predicted distress rate. Dots are realised distress rates. Black dots are not covered by confidence intervals.
Model is estimated up to the year prior to the forecasting year.
Starting point is a time-varying intercept as in Duffie et al. (2009).
\[ \begin{align*} g(E(y_{it}|\vec x_{it}, \vec z_t, u_t)) &= \vec\beta^\top\vec x_{it}+ \vec \gamma^\top \vec z_t + u_t \\ u_t &\sim N(\theta u_{t - 1}, \sigma^2) \end{align*} \]
where \(g\) is the inverse complementary-log-log function.
It may be that they actually do everything in continuous time but it does not appear to be the case when you look at Appendix B and C. Moreover, it seems that a Poisson process with a log-offset (the continuous version) is used instead of the inverse complementary-log-log function (the discrete version) in their approximation. It presumably does not matter.
See the R package at CRAN.R-project.org/package=dynamichazard for implementation details.
| Df | AIC | |
|---|---|---|
| Model without frailty | 17 | 7378 |
| Random intercept | 19 | 7345 |
| Random intercept and log relative size | 22 | 7246 |
| Random intercept, log relative size, and industry dummies | 25 | 7241 |
Estimates are \((\hat\theta, \hat\sigma) = (0.8912, 0.1484)\).
Investigate reduced capital requirements of SMBs in Basel II/III as in Jensen, Lando, and Medhat (2017).
Under the IRB approach for corporate credits, banks will be permitted to separately distinguish exposures to SME borrowers (defined as corporate exposures where the reported sales for the consolidated group of which the firm is a part is less than €50 million) from those to large firms. A firm-size adjustment … is made to the corporate risk weight formula for exposures to SME borrowers.
Article 273 in Basel Committee on Banking Supervision (2006).
\[ \begin{align*} g(E(y_{it}|\vec x_{it}, \vec z_t, \vec u_t, o_{it})) &= \vec\beta^\top\vec x_{it}+ \vec \gamma^\top \vec z_t + u_{I,t} +u_{R,t} o_{it} \\ \begin{pmatrix} u_{I,t} \\ u_{R,t} \end{pmatrix} &\sim N\left( \begin{pmatrix} \theta_Iu_{I,t-1} \\ \theta_Ru_{R,t-1} \end{pmatrix}, Q\right) \end{align*} \]
| Df | AIC | |
|---|---|---|
| Model without frailty | 17 | 7378 |
| Random intercept | 19 | 7345 |
| Random intercept and log relative size | 22 | 7246 |
| Random intercept, log relative size, and industry dummies | 25 | 7241 |
Estimates are
\[ \begin{align*} \begin{pmatrix} \hat\theta_I \\ \hat\theta_R \end{pmatrix} &= \begin{pmatrix} 0.9433 \\ 0.9759 \end{pmatrix} & \hat Q&= \hat V\hat C\hat V\\ \hat V&=\begin{pmatrix} 0.2706 & \cdot \\ \cdot & 0.0562 \end{pmatrix} & \hat C&=\begin{pmatrix} 1 & 0.9004 \\ 0.9004 & 1 \end{pmatrix} \end{align*} \]
The fixed slope estimate 0.04133.
Points are means for distressed firms and crosses are overall mean. The
two lines are smoothing splines.
Lando et al. (2013) finds a time-varying log pledgeable assets effects in a non-parametric Aalen model.
Azizpour, Giesecke, and Schwenkler (2018) uses exponentially weighted log past defaulted debt in aggregate distress prediction.
Advocated by Chava and Jarrow (2004) and Chava, Stefanescu, and Turnbull (2011).
No evidence of industry effects in the model without frailty.
Use dummies from Chava and Jarrow (2004):
\[ \begin{align*} g(E(y_{it}|\vec x_{it}, \vec z_t, \vec u_t, \vec o_{it})) &= \vec\beta^\top\vec x_{it}+ \vec \gamma^\top \vec z_t + \vec u_{t}^\top\vec o_{it} \\ \begin{pmatrix} u_{I_1,t} \\u_{I_2,t} \\u_{I_3,t} \\ u_{R,t} \end{pmatrix} &\sim N\left( \begin{pmatrix} \theta_Iu_{I_1,t-1} \\ \theta_Iu_{I_2,t-1} \\ \theta_Iu_{I_3,t-1} \\ \theta_Ru_{R,t-1} \end{pmatrix}, Q\right) \end{align*}, \qquad Q=VCV \]
\[ V=\begin{pmatrix} \sigma_{I_1} & \cdot & \cdot & \cdot \\ \cdot & \sigma_{I_2} & \cdot & \cdot \\ \cdot & \cdot & \sigma_{I_3} & \cdot \\ \cdot & \cdot & \cdot & \sigma_R \end{pmatrix}, \qquad C = \begin{pmatrix} 1 & \rho_I & \rho_I & \rho_{IR} \\ \rho_I & 1 & \rho_I & \rho_{IR} \\ \rho_I & \rho_I & 1 & \rho_{IR} \\ \rho_{IR} & \rho_{IR} & \rho_{IR} & 1 \end{pmatrix} \]
| Df | AIC | |
|---|---|---|
| Model without frailty | 17 | 7378 |
| Random intercept | 19 | 7345 |
| Random intercept and log relative size | 22 | 7246 |
| Random intercept, log relative size, and industry dummies | 25 | 7241 |
\[ \begin{align*} (\hat\theta_I, \hat\theta_R) &= (0.9062, 0.9616) \\ (\hat\sigma_{I_1}, \hat\sigma_{I_2}, \hat\sigma_{I_3},\hat\sigma_R) &= (0.30961, 0.34511, 0.31397, 0.06506) \\ (\hat\rho_I, \hat\rho_{IR}) &= (0.8574, 0.9024) \end{align*} \]
Could compare with Kwon and Lee (2018).
Rarely focused on. One example is Berg (2007).
(Partial) Association between size of the firm and distress is non-constant through time.
Weak evidence for partial industry effects.
Document importance. Could make model for \((\vec x_{it}, \vec z_t, \vec o_{it})\) as in Duffie, Saita, and Wang (2007) and Duffie et al. (2009). This may lead to poor performance as documented in Duan, Sun, and Wang (2012).
Alternatively, forgo the the covariate prediction problem and make a 1 year distress model.
Explore other time-varying coefficients.
Potential issue with Moody’s data.
Azizpour, S, K. Giesecke, and G. Schwenkler. 2018. “Exploring the Sources of Default Clustering.” Journal of Financial Economics 129 (1): 154–83. doi:https://doi.org/10.1016/j.jfineco.2018.04.008.
Basel Committee on Banking Supervision. 2006. “International Convergence of Capital Measurement and Capital Standards: A Revised Framework, (Comprehensive Version).” Bank for International Settlements.
Beaver, William H., Maureen F. McNichols, and Jung-Wu Rhie. 2005. “Have Financial Statements Become Less Informative? Evidence from the Ability of Financial Ratios to Predict Bankruptcy.” Review of Accounting Studies 10 (1): 93–122. doi:10.1007/s11142-004-6341-9.
Berg, Daniel. 2007. “Bankruptcy Prediction by Generalized Additive Models.” Applied Stochastic Models in Business and Industry 23 (2). John Wiley & Sons, Ltd.: 129–43. doi:10.1002/asmb.658.
Bharath, Sreedhar T., and Tyler Shumway. 2008. “Forecasting Default with the Merton Distance to Default Model.” The Review of Financial Studies 21 (3): 1339–69. doi:10.1093/rfs/hhn044.
Campbell, Jens, John Y. And Hilscher, and Jan Szilagyi. 2008. “In Search of Distress Risk.” The Journal of Finance 63 (6). Blackwell Publishing Inc: 2899–2939. doi:10.1111/j.1540-6261.2008.01416.x.
Chava, Sudheer, and Robert A. Jarrow. 2004. “Bankruptcy Prediction with Industry Effects *.” Review of Finance 8 (4): 537–69. doi:10.1093/rof/8.4.537.
Chava, Sudheer, Catalina Stefanescu, and Stuart Turnbull. 2011. “Modeling the Loss Distribution.” Management Science 57 (7): 1267–87. doi:10.1287/mnsc.1110.1345.
Dempster, A. P., N. M. Laird, and D. B. Rubin. 1977. “Maximum Likelihood from Incomplete Data via the Em Algorithm.” Journal of the Royal Statistical Society. Series B (Methodological) 39 (1). [Royal Statistical Society, Wiley]: 1–38. http://www.jstor.org/stable/2984875.
Duan, Jin-Chuan, Jie Sun, and Tao Wang. 2012. “Multiperiod Corporate Default Prediction—A Forward Intensity Approach.” Journal of Econometrics 170 (1): 191–209. doi:https://doi.org/10.1016/j.jeconom.2012.05.002.
Duffie, Darrell, Andreas Eckner, Guillaume Horel, and Leandro Saita. 2009. “Frailty Correlated Default.” The Journal of Finance 64 (5). Blackwell Publishing Inc: 2089–2123. doi:10.1111/j.1540-6261.2009.01495.x.
Duffie, Darrell, Leandro Saita, and Ke Wang. 2007. “Multi-Period Corporate Default Prediction with Stochastic Covariates.” Journal of Financial Economics 83 (3): 635–65. doi:https://doi.org/10.1016/j.jfineco.2005.10.011.
Fearnhead, Paul, David Wyncoll, and Jonathan Tawn. 2010. “A Sequential Smoothing Algorithm with Linear Computational Cost.” Biometrika 97 (2). [Oxford University Press, Biometrika Trust]: 447–64. http://www.jstor.org/stable/25734097.
Gordon, N. J., D. J. Salmond, and A. F. M. Smith. 1993. “Novel Approach to Nonlinear/Non-Gaussian Bayesian State Estimation.” IEE Proceedings F - Radar and Signal Processing 140 (2): 107–13. doi:10.1049/ip-f-2.1993.0015.
Hwang, Ruey-Ching. 2012. “A Varying-Coefficient Default Model.” International Journal of Forecasting 28 (3): 675–88. doi:https://doi.org/10.1016/j.ijforecast.2011.11.006.
Jensen, Thais, David Lando, and Mamdouh Medhat. 2017. “Cyclicality and Firm-Size in Private Firm Defaults.” International Journal of Central Banking 13 (4): 97–145.
Koopman, Siem Jan, André Lucas, and Bernd Schwaab. 2011. “Modeling Frailty-Correlated Defaults Using Many Macroeconomic Covariates.” Journal of Econometrics 162 (2): 312–25. doi:https://doi.org/10.1016/j.jeconom.2011.02.003.
Kwon, Tae Yeon, and Yoonjung Lee. 2018. “Industry Specific Defaults.” Journal of Empirical Finance 45: 45–58. doi:https://doi.org/10.1016/j.jempfin.2017.10.002.
Lando, David, Mamdouh Medhat, Mads Stenbo Nielsen, and Søren Feodor Nielsen. 2013. “Additive Intensity Regression Models in Corporate Default Analysis.” Journal of Financial Econometrics 11 (3): 443–85. doi:10.1093/jjfinec/nbs018.
Meng, Xiao-Li, and Donald B. Rubin. 1993. “Maximum Likelihood Estimation via the Ecm Algorithm: A General Framework.” Biometrika 80 (2). [Oxford University Press, Biometrika Trust]: 267–78. http://www.jstor.org/stable/2337198.
Nickerson, Jordan, and John M. Griffin. 2017. “Debt Correlations in the Wake of the Financial Crisis: What Are Appropriate Default Correlations for Structured Products?” Journal of Financial Economics 125 (3): 454–74. doi:https://doi.org/10.1016/j.jfineco.2017.06.011.
Pitt, Michael K., and Neil Shephard. 1999. “Filtering via Simulation: Auxiliary Particle Filters.” Journal of the American Statistical Association 94 (446). [American Statistical Association, Taylor & Francis, Ltd.]: 590–99. http://www.jstor.org/stable/2670179.
Qi, Min, Xiaofei Zhang, and Xinlei Zhao. 2014. “Unobserved Systematic Risk Factor and Default Prediction.” Journal of Banking & Finance 49: 216–27. doi:https://doi.org/10.1016/j.jbankfin.2014.09.009.
Shumway, Tyler. 2001. “Forecasting Bankruptcy More Accurately: A Simple Hazard Model.” The Journal of Business 74 (1). The University of Chicago Press: 101–24. http://www.jstor.org/stable/10.1086/209665.
Vassalou, Maria, and Yuhang Xing. 2004. “Default Risk in Equity Returns.” The Journal of Finance 59 (2). [American Finance Association, Wiley]: 831–68. http://www.jstor.org/stable/3694915.
Wood, S.N. 2017. Generalized Additive Models: An Introduction with R. 2nd ed. Chapman; Hall/CRC.